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Question 1 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) describe the distribution of a 
quantitative variable based on a histogram and (2) determine the effect of changing one data value on the 
mean and the median. 


Solution 
Part (a): 


The distribution of Robin's tip amounts is skewed to the right. There is a gap between the largest tip 
amount (in the $20 to $22.50 interval) and the second largest tip amount (in the $12.50 to $15 interval), 
and the largest tip amount appears to be an outlier. The median tip amount is between $2.50 and 
$5.00. Robin's tip amounts vary from a minimum of between $0 and $2.50 to a maximum of between 
$20.00 and $22.50. About 78 percent of the tip amounts are between $0 and $5. 


Part (b): 


The mean: If the $8 tip had been $18, the mean would increase by $10 divided by 60, or $ z, or about 


17 cents. 


The median: If the $8 tip had been $18, the median would not change because the current median is 
between $2.50 and $5.00, and both $8 and $18 are greater than that. 


Scoring 


Parts (a) and (b) are scored as essentially correct (E), partially correct (P), or incorrect (1). 
Part (a) is scored as follows: 


Essentially correct (E) if the response includes reasonable comments on the following five components: 

1. Shape (skewed right) 

2. Outlier (at least one) OR gap (one tip amount greater than $20, next highest at most $15) 

3. Center between $2.50 and $5.00 (median) or between $2.62 and $5.13 (mean) 

4. Variability, by noting that the tip amounts vary from about $0 to at most $22.50, or that a 
majority of tip amounts are between $0 and a value greater than or equal to $5, or by providing 
a correct numerical approximation of a measure of variability 

5. Context (tip amounts) 


Partially correct (P) if the response includes only three or four of the five components. 


Incorrect (I) if the response includes at most two of the five components. 


© 2016 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


AP® STATISTICS 
2016 SCORING GUIDELINES 


Question 1 (continued) 
Part (b) is scored as follows: 
Essentially correct (E) if the response includes the following four components: 
1. Comments that the mean will increase 
2. Correctly justifies why the mean will increase 
3. Comments that the median will not change 
4. Correctly justifies why the median will not change 
Partially correct (P) if the response includes only two or three of the four components. 
Incorrect (I) if the response includes at most one of the four components. 
4 Complete Response 
Both parts essentially correct 
3 Substantial Response 
One part essentially correct and one part partially correct 
2 Developing Response 
One part essentially correct and one part incorrect 
OR 
Both parts partially correct 


1 Minimal Response 


One part partially correct and one part incorrect 
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Ouestion 2 


Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) identify, set up, perform, and 
interpret the results of an appropriate hypothesis test to address a particular question and (2) assess the 
effectiveness of treatments in a controlled experiment. 


Solution 
Part (a): 
step 1: States a correct pair of hypotheses. 


H, : The proportion of children who would choose each snack is the same regardless of which type of 
ad is viewed. 

H, : The proportion of children who would choose each snack differs based on which type of ad is 
viewed. 


Step 2: Identifies a correct test procedure (by name or formula) and checks appropriate conditions. 
The appropriate procedure is a chi-square test of homogeneity. 
The conditions for this test are satisfied because (1) the question states that the children were 


randomly assigned to groups, and (2) expected counts for the six cells of the table are all at least 5, as 
seen in the following table that lists expected counts beside observed counts. 


Apple-Zuties 
A 21 (18.67 
il 


Po A 21 (18.67) 4 (6.33) 
Po UBT 13(18.67) | 126.33) | 
rc. 22 (18.67) 3 (6.33) 





5 
5 
C : 


step 3: Calculates the appropriate test statistic and p-value. 


ae 
The test statistic is calculated as x’ = ye which is 


Zz 
a 


0.292 + 0.860 + 
1.720 + 5.070 + 
0.595 + 1.754 = 10.291. 


The p-value is P(y“a¢5 = 10.291) ~ 0.006. 


step 4: States a correct conclusion in the context of the study, using the result of the statistical test. 


Because the p-value is very small (for instance, much smaller than a = 0.05), we reject the null 
hypothesis at the 0.05 level (and at the 0.01 level). The data provide convincing statistical evidence 
that the proportions who would choose each snack differ based on which ad is viewed. 
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Question 2 (continued) 


Part (b): 


When neither ad was viewed, a or 88 percent of the children chose Choco-Zuties whereas only 


12 percent chose Apple-Zuties. 


When the Choco-Zuties ad was viewed, 84 percent of the children chose Choco-Zuties, which was 
very similar to the percentage that chose them without viewing any ad. So watching the Choco-Zuties 
ad did not affect the snack choice very much. 


When the Apple-Zuties ad was viewed, only = or 52 percent of the children chose Choco-Zuties, and 


48 percent chose Apple-Zuties. Watching the Apple-Zuties ad seemed to increase the proportion of 
children choosing Apple-Zuties. 


Scoring 


This question is scored in four sections. Section 1 consists of steps 1 and 2 in part (a): section 2 consists of 
step 3 in part (a); section 3 consists of step 4 in part (a); and section 4 consists of part (b). Sections 1, 2, 3 
and 4 are scored as essentially correct (E), partially correct (P), or incorrect (I). 


Section 1 is scored as follows: 


Essentially correct (E) if the response includes the following three components: 
1. Both hypotheses are stated correctly with at least one in context. 
2. Identifies the correct test procedure (by name or by formula). 
3. The technical conditions are checked (all expected counts are greater than or equal to 5). 


Partially correct (P) if the response includes only two of the three components. 
Incorrect (I) if the response correctly includes at most one of the three components. 


Notes for component 1: 


e Itis acceptable if hypotheses are stated in terms of association/independence instead of 
proportions. 


e Context must be included in at least one of the hypotheses, but is not required to be in both. 


e Ifthe hypotheses contain language that suggests that the response refers to the sample data, 
the component is not satisfied. 


Note for component 2: It is acceptable if the test is identified as a test of independence instead of a test 
of homogeneity. 


Notes for component 3: 
e ‘The random assignment condition was stated so need not be explicitly mentioned. 
e Stating the expected count condition is met is not sufficient for component 3. The condition 
must be checked by reporting the expected counts and either: 
Oo noting that all are greater than or equal to 5; 
OR 
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Question 2 (continued) 


Oo noting that all are greater than or equal to 1 and at most 20 percent are less than 5. 
e Noting that the smallest expected count is 6.33 and that it is greater than or equal to 5 is 
sufficient to satisfy this component. 
e Component 3 is not satisfied if the expected counts are reported as integers. 
e ‘This component is not satisfied if the response includes any of the following inappropriate 
conditions: 
o ‘The response implies that a random sample was taken, e.g., “SRS — check.” 
o ‘The response refers to independence of groups or independence of ads as a required 
condition. 
o ‘The response indicates that a sample size greater than 30 ensures normality or the 
response implies normality as a condition. 
e A response stating that children are independent can be ignored in the scoring of this 
component. 


Section 2 is scored as follows: 


Essentially correct (E) if the response includes the following three components: 
1. Correct chi-square test-statistic 
2. Correct degrees of freedom 
3. Correct p-value 


Partially correct (P) if the response includes only two of the three components. 
Incorrect (I) if the response correctly includes at most one of the three components. 


Notes: 
e Ifthe response makes an error in one calculation, future calculations are considered correct if 
they follow correctly from the initial miscalculation. 


e Achi-square critical value approach is acceptable: The critical value for a = 0.05 is x? = 5.99. 


Section 3 is scored as follows: 


Essentially correct (E) if the response provides a correct conclusion about the alternative hypothesis in 
context, AND provides justification based on linkage between the p-value and conclusion. 


Partially correct (P) if the response provides a correct conclusion in context, but without justification 
based on linkage to the p-value; 

OR 
if the response provides a correct conclusion, with linkage to the p-value, but not in context; 

OR 
if the response provides a correct decision stated in terms of the null hypothesis in context, with 
linkage to the p-value, but no conclusion is made about the alternative hypothesis. 


Incorrect (I) if the response does not meet the criteria for E or P. 
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Question 2 (continued) 


Notes: 

e Ifthe conclusion is consistent with the p-value from section 2, and also in context with 
justification based on linkage to the p-value, section 3 is scored as E. 

e Ifno alpha level is given, the solution must be explicit about the linkage by giving a correct 
interpretation of the p-value or by explaining how the conclusion follows from the p-value such 
as saying: “Because the p-value is small, we reject the null hypothesis.” or “Because the p-value 
is large, we do not reject the null hypothesis.” 

e Ifaconclusion contains language that suggests that the response refers to the sample data, the 
conclusion component is not correct, unless the same error occurred in the statement of 
hypotheses in section 1. 


Section 4 is scored as follows: 


Essentially correct (E) if the response, in context, concludes that ad type A had little effect and ad type 
B had an effect, both supported by the observed proportions (or counts) from the table. 


Partially correct (P) if the response correctly concludes that ad type A had little effect, and ad type B 
had an effect but does not provide correct numerical justification: 

OR 
if the response compares all of the proportions (or counts) as required, but without correctly concluding 
ad effectiveness: 

OR 
if the response correctly describes the effect of one of the ad types A or B (in context, with correct 
numerical justification) but not the other: 

OR 
if the response compares all of the proportions (or counts) as required, but not in context. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Each essentially correct (E) section counts as 1 point, and a partially correct (P) section counts as % point. 


4 Complete Response 

3 Substantial Response 
2 Developing Response 
1 Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to decide whether to 
score up or down, depending on the strength of the response and communication. 
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Question 3 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) identify explanatory and 
response variables from the description of a research study; (2) indicate and justify whether a study is 
observational or experimental; and (3) explain what confounding means in the context of a particular study 
with a specific confounding variable. 


Solution 
Part (a): 


The explanatory variable is the person's degree of cigarette smoking. ‘The response variable is whether 
the person develops Alzheimer’s disease during the course of the study. 


Part (b): 


This is an observational study because the people in the study were not assigned to a certain degree of 
cigarette smoking. Rather, the degree of cigarette smoking for each person was passively observed and 
recorded, not manipulated by the researchers. 


Part (c): 


A confounding variable is one that is related to the explanatory variable and possibly influences the 
response variable. In this case it seems plausible that people who exercise more regularly might be 
more health conscious, therefore, less likely to smoke cigarettes than people who do not exercise 
regularly. Similarly, it’s possible that people who exercise more regularly are less likely to develop 
Alzheimer's disease than people who do not exercise regularly. If both of these relationships turn out to 
be true, then smoking cigarettes would be associated with developing Alzheimer’s disease due to the 
association of both variables with exercise, even if there were no cause-and-effect relationship 
between smoking cigarettes and developing Alzheimer's disease. 


Scoring 
Parts (a), (b), and (c) are scored as essentially correct (E), partially correct (P), or incorrect (I). 
Part (a) is scored as follows: 


Essentially correct (E) if both variables are described correctly. A correct description includes some 
degree of status of the variables, such as smoking versus not smoking and developing Alzeimer’s 
versus not developing Alzheimer's. 


Partially correct (P) if one variable is described correctly and one is not described correctly: 
OR 
if neither variable is described correctly but smoking is mentioned as the explanatory variable, and 
Alzheimer’s is mentioned as the response variable; 
OR 
if the explanatory and response variables are correctly described but are interchanged. 


Incorrect (I) if the response does not meet the criteria for E or P. 
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Question 3 (continued) 


Note: Describing the variables only as smoking and Alzheimer’s without a reference to levels/status is 
not sufficient for E. But making the connection between smoking and explanatory and between 
Alzheimer’s and response is sufficient for P. 


Part (b) is scored as follows: 


Essentially correct (E) if the response selects the correct type of study (observational) and provides the 
correct explanation that smoking status was not assigned OR that smoking status was only observed. 


Partially correct (P) if the response selects the correct type of study (observational) and provides a 
correct explanation but does not refer to smoking status as the variable that would have been assigned 
had it been an experiment; for instance, by simply stating that treatment was not assigned; 

OR 
if the response selects the correct type of study (observational) and provides an explanation that says 
that smoking is observed (or that smoking status and Alzheimer’s are observed) without a modifier for 
observed (such as, only, just, merely, simply) AND without indicating that treatments were not 
assigned. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 

e Aresponse that states the explanatory variable was not assigned without naming smoking 
status is sufficient for E if the explanatory variable is correctly defined in part (a). 

e A response that states that smoking status is ONLY observed (or that smoking status and 
Alzheimer’s are ONLY observed) is sufficient for E. 

e A response that says that Alzheimer’s is observed without mentioning smoking status is 
scored I. 

e Aresponse that provides an incorrect statistical explanation (such as, “the study is 
observational because an experiment must have a control group”) lowers the score in part (b) 
by one level (from E to P or from P to J). 

e If Alzheimer’s is given as the explanatory variable and smoking is given as the response 
variable in part (a), then part (b) should be scored accordingly with the two variables 
interchanged. 

e Any phrase that refers to the “effect of smoking on Alzheimer’s” or “the association between 
smoking and Alzheimer’s” (rather than smoking status) should be ignored. 

Part (c) is scored as follows: 


Essentially correct (E) if the response includes the following two components: 
1. Provides a reasonable explanation that exercise status is related to smoking status. 
2. states that exercise status might influence whether the person develops Alzheimer’s disease. 


Partially correct (P) if the response describes only one of the two components; 

OR 
if the response only describes how smoking and exercise jointly influence whether the person develops 
Alzheimer's: 

OR 
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Question 3 (continued) 


if the response mentions both of the associations in each component, but also includes an incorrect 
statistical statement. For instance, part (a) is correct, but in part (c), the explanatory and response 
variables are interchanged. 


Incorrect (I) if the response does not meet the criteria for E or P. 


4 


Complete Response 

Three parts essentially correct 
Substantial Response 

‘T'wo parts essentially correct and one part partially correct 
Developing Response 

Two parts essentially correct and no parts partially correct 
OR 

One part essentially correct and one or two parts partially correct 
OR 

Three parts partially correct 
Minimal Response 

One part essentially correct 


OR 
No parts essentially correct and two parts partially correct 
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Question 4 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) calculate a probability using 
basic probability rules or the geometric distribution; (2) recognize that a probability calculation for 
independent events does not depend on the previous outcomes of those events; and (3) assess whether a 
claim about the probability of a single event is reasonable based on a calculated probability of a series of 
those events. 


Solution 
Part (a): 


If the failure rate for the super igniters is 15 percent, then the probability that each igniter fails is 0.15, 
and the probability that it does not fail is 0.85. Therefore the probability that the first 30 igniters tested 


do not fail is (0.85)°" ~ 0.0076. The solution can also be written as (1 — 0.15)" ~ 0.0076. 
Part (b): 


Given that there are no failures in the first 30 trials, the probability that the first failure occurs on the 
31* trial is 0.15, and the probability that it does not occur on the 31* but occurs on the 32" trial is 
(0.85)(0.15) = 0.1275. Therefore the probability that the first failure occurs on the 31 or 32"° super 


igniter tested is 0.15 + 0.1275 = 0.2775. 


Note that this is equivalent to asking for the probability that the first failure occurs on the first or 
second trial, which is 0.15 + (0.85)(0.15) = 0.2775. 


Part (c): 
The result of the probability calculation in part (a) provides a reason to believe that the failure rate of 
the super igniters is less than 15 percent. The calculated probability of 0.0076 shows that there is less 
than a 1 percent chance that 30 or more igniters in a row would not fail if the failure rate was 
15 percent. This probability is smaller than conventional significance levels such as a = 0.05 or 


a = 0.01, and thus is small enough to make it reasonable to believe that the failure rate of the super 
igniters is less than 15 percent. 


scoring 
Parts (a), (bo), and (c) are scored as essentially correct (E), partially correct (P), or incorrect (1). 
Part (a) is scored as follows: 
Essentially correct (E) if the response gives the correct probability AND correct justification. 
Partially correct (P) if the response correctly notes that the answer is the probability that there will be 


30 successes in 30 attempts, but does not carry out a correct probability calculation; 
OR 
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Question 4 (continued) 


if the response defines the random variable X as the trial with the first failure, identifies X as having a 
geometric distribution with p = 0.15, and writes the desired probability as P(X > 30), but does not 
carry out a correct probability calculation: 

OR 
if the response defines the random variable X as the number of failures in the first 30 attempts, 
identifies X as a binomial random variable with p = 0.15 and n = 30, and writes the desired 
probability as P(X = 0), but does not carry out a correct probability calculation: 

OR 
if the response gives the correct probability but, in specifying a geometric or binomial distribution, has 
an incorrect or incomplete definition of parameters or value(s) of the random variable. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Note: Justification can be given using the multiplication rule; OR by defining X to be the trial with the 
first failure, recognizing that X has a geometric distribution, and using that information to find 

P(X > 30): OR by defining X to be the number of failures in the first 30 attempts, and then finding 
P(X = QO) using either probability rules or the binomial distribution with n = 30 and p = 0.15. 


Part (b) is scored as follows: 
Essentially correct (E) if the response gives the correct probability AND correct justification. 


Partially correct (P) if the response makes a reasonable attempt to calculate a geometric, binomial, or 
conditional probability, but does not successfully carry out the calculation: 

OR 
if the response gives the correct probability but, in specifying a geometric or binomial distribution, has 
an incorrect or incomplete definition of parameters or value(s) of the random variable. 


Incorrect (I) if the response finds an incorrect probability resulting from an unreasonable attempt to 
calculate a geometric, binomial, or conditional probability or otherwise does not meet the criteria for E 
or P. 


Note: Similar to part (a) justification can be given using probability rules; OR by stating that X is 
geometric where X is the trial with the first failure, then finding P(X = 1 or X = 2): OR by stating that 
X is the number of failures in two trials and finding 1—- P(X = O)or P(X =1o0rX = 2) using the 
binomial distribution. 


Part (c) is scored as follows: 


Essentially correct (E) if the response states that it is reasonable to believe that the failure rate is less 
than 15 percent AND bases this decision on the fact that the probability of 30 consecutive successful 
launches with a failure rate of 15 percent (that is, answer from part (a)) is small AND does so in the 
context of the situation. 


Partially correct (P) if the response otherwise satisfies the criteria for an (E) but does so without any 
context; 
OR 
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Question 4 (continued) 


if the response states a significance level and makes a decision in a context that is appropriate to the 
given probability in part (a) and the stated significance level but does not explicitly compare the 
probability and the significance level (no linkage). 


Incorrect (I) if the response does not explicitly make a decision about whether it is reasonable to 
conclude that the failure rate is less than 15 percent (For example: “As seen in Part (a), if the failure rate 
is 15 percent then the probability of 30 successful launches in a row is very small.”): 

OR 
if the response otherwise does not meet the criteria for E or P. 


Notes: 


e Justification based on the probability can come by stating a significance level and noting that 
the probability is smaller than the significance level OR by simply stating that the probability of 
0.0076 is small OR by referring to the expected number of failures (4.5) as being very unlikely 
because zero failures is more than two standard deviations below 4.5. 


e Ifthe response bases the decision on the expected number of failures (4.5) for n = 30 and 
p = 0.15 without referencing why zero failures would be considered to be too far below 4.5 to 
give reason to doubt the stated 15 percent failure rate, the response is scored P. 
e Ifthe calculation in part (a) is incorrect, the answer in part (c) needs to be consistent with the 
answer in part (a), unless the value is recalculated in part (c). 
4 Complete Response 
Three parts essentially correct 
3 Substantial Response 
Two parts essentially correct and one part partially correct 


2 Developing Response 


Two parts essentially correct and no parts partially correct 


OR 
One part essentially correct and one or two parts partially correct 
OR 
Three parts partially correct 
1 Minimal Response 
One part essentially correct 
OR 


No parts essentially correct and two parts partially correct 
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Question 5 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) construct and interpret a 
confidence interval for a population proportion: (2) explain why one of the conditions for inference is 
necessary; and (3) explain why a suggested procedure for constructing a confidence interval is incorrect. 


Solution 
Part (a): 


The appropriate procedure is a one-sample Z-interval for a population proportion. The problem stated 
the conditions for inference have been met, so they do not need to be checked. A 95 percent 


confidence interval for the population proportion is given as p + Zz Alp which is 


(0.37)(0.63) 


+ 
30a 196 1.048 


= 0.37 + 0.03 = (0.34,0.40). 


We are 95 percent confident that the population proportion of all adults in the U.S. who would have 
chosen the economy statement is between 0.34 and 0.40. 


Part (b): 


The condition is necessary because the formula for the confidence interval relies on the fact that the 
binomial distribution can be approximated by a normal distribution which then results in the sampling 
distribution of p being approximately normal. The approximation does not work well unless both np 


and n(1— p) are at least 10. 
Part (c): 


The suggested procedure is not appropriate because one of the requirements for using a two-sample 
z-interval for a difference between proportions is that the two proportions are based on two 
independent samples. In the situation described the two proportions come from a single sample and 
thus are not independent. 


Scoring 


This question is scored in three sections. Section 1 consists of the mechanics of the confidence interval in 
part (a), section 2 consists of interpreting the confidence interval in part (a), and section 3 consists of parts 
(lb) and (c). 


Section 1 is scored as follows: 
Essentially correct (E) if the response includes the following two components: 
1. States the correct procedure by name or formula 
2. Calculates the confidence interval 


Partially correct (P) if the response includes only one of the two components. 


Incorrect (I) if the response does not meet the criteria for E or P. 
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Question 5 (continued) 


Notes: 
e A formula with correct values is sufficient for component 2. 
e Component 2 can never be satisfied if the response contains an incorrect formula. 


Section 2 is scored as follows: 


Essentially correct (E) if the response includes the following four components for the interval 
interpretation: 

1. Estimates a proportion 

2. Infers about the population 

3. States 95 percent confidence 

4. Includes context 


Partially correct (P) if the response satisfies components 1 and 2 AND satisfies only one of components 
3 or 4: 
OR 


if the response gives a correct interpretation of the confidence /evel in context without interpreting the 
specific interval. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 


e Any indication of an inference to the adults sampled rather than the population does not satisfy 
component 2 and is scored as I. 


e Stating values that are unrealistic as proportions (including blanks) in the interpretation lowers 
the score by one level (from E to P or from P to J). 


e When both the interpretation of the interval and the level are given, only the interpretation of 
the interval is scored. 


Section 3 is scored as follows: 


Essentially correct (E) if the response includes the following two components: 
1. Part (b) states that the condition implies the sampling distribution of p is approximately 
normal OR the normal approximation to the binomial distribution is appropriate. 
2. Part (c) indicates the procedure is not appropriate because the two proportions come from a 
single sample (dependent) rather than two (independent) samples. 


Partially correct (P) if the response includes only one of the two components. 
Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 
e Referring to the sampling distribution needing to be normal without explicitly stating p 
satisfies component 1. 
e Referring to normal approximation must specify “to binomial” to satisfy component 1. 


e Discussing the two-sample z-interval versus the two-proportion z-interval is extraneous 
information. 
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Question 5 (continued) 


In part (c) component 2 can never be satisfied if a procedure that is not possible for one sample 
of categorical data is suggested. 


Complete Response 

Three parts essentially correct 
Substantial Response 

Two parts essentially correct and one part partially correct 
Developing Response 

Two parts essentially correct and no parts partially correct 
OR 

One part essentially correct and one or two parts partially correct 
OR 


Three parts partially correct 


Minimal Response 


One part essentially correct 
OR 
No parts essentially correct and two parts partially correct 
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Question 6 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) use a scatterplot to comment on 
a report about the relationship between two variables and interpret the slope for the least-squares 
regression line summarizing this relationship; (2) describe the relationship between two variables in a 
scatterplot when a categorical variable is introduced and compare a characteristic of the distribution of a 
variable for different categories of individuals in a scatterplot; and (3) describe how the associations 
between two variables for each category of individuals in a scatterplot differ from the overall association in 
the same scatterplot. 


Solution 
Part (a): 


The scatterplot supports the newspaper report about number of semesters needed to complete an 
academic program and starting salary because it shows a positive association between these two 
variables. 


Part (b): 


The slope is 1.1594. For each additional semester needed to complete an academic program, the 
predicted starting salary increases by €1,159.40. 


Part (c): 


For the business majors alone, there is a strong, negative, linear association between number of 
semesters and starting salary. Business majors who need a greater number of semesters to complete 
an academic program tend to have lower starting salaries. 


Part (d): 


Business majors have the lowest median starting salary at around €38,000, followed by physics majors 
at around €48,000, and then chemistry majors with the highest median starting salary at around 
£95,000. 


Part (e): 


The newspaper report should be modified to account for major. Overall, majors that take longer to 
complete tend to have higher starting salaries, with chemistry the highest, physics the next highest, 
and business the lowest. However, within a major, students who take a greater number of semesters 
tend to have lower starting salaries. 


Scoring 


This question is scored in three sections. Section 1 consists of parts (a) and (b), section 2 consists of parts 
(c) and (d), and section 3 consists of part (e). Sections 1, 2, and 3 are scored as essentially correct (E), 
partially correct (P), or incorrect (I). 
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Question 6 (continued) 


Section 1 is scored as follows: 


Essentially correct (E) if the response includes the following five components: 


‘i 
Li 


2: 


In part (a) the response addresses the positive association. 

In part (a) the response uses the positive association to justify that the scatterplot supports the 
newspaper report. 

In part (b) the response correctly identifies the numerical value of the slope from the computer 
output. 

In part (b) the response interprets the slope as the change in starting salary for each additional 
semester, in context. 

In part (b) the interpretation of slope includes nondeterministic language (e.g., “predicted 
Starting salary” or equivalent) when interpreting the slope. 


Partially correct (P) if the response includes three or four of the five components. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 


In part (a) the response can use phrases such as “positive association (correlation, 
relationship),” “increasing relationship,” or describe a positive association (e.g., “the starting 
salaries are higher when there is a greater number of semesters”) to satisfy component 1. 
However, describing the relationship between only two points does not satisfy this component. 
In part (a) comments about linearity and strength should be ignored, even if the response 
implies these are required (e.g., “Yes, because the relationship is strong, positive, and linear.”). 
In part (a) responses that answer “no,” “maybe,” “kind of,” “somewhat,” or equivalent do not 
satisfy component 2. 

In part (a) a response that says “no, because the three clusters of points each have a negative 
association” does not satisfy component 1 or component 2. 

In part (a) no context is required, but variable names are required in part (b) to satisfy 
component 4. 

In part (b) a response that states the equation y = 34.018 + 1.1594x does not satisfy 
component 3 unless the slope is specifically identified or used in the interpretation. 

In part (b) a response that incorrectly identifies the numerical value of the slope can still satisfy 
components 4 and 5 using the incorrect value. 

In part (b) the 1-unit increase in number of semesters must be stated or implied to satisfy 
component 4 (e.g., for each semester, for every semester). A response that states or implies an 
unspecified number of semesters does not satisfy this component (for example, as semesters 
increase). 

In part (b) examples of nondeterministic language include “predicted starting salary,” 
“expected starting salary,’ “estimated starting salary,’ “typical starting salary,” “average 
starting salary,’ “starting salary, on average,’ “our model predicts,” and so on. However, 
“about,” “approximately,” and “according to the model” do not satisfy component b. 

In part (b) no units are required for the change in predicted starting salary (which means it is 
OK to say 1.1594 euros, 1159.40, 1.1594, 1,159.40 dollars, and so on). 


In all parts it is acceptable if a response refers to salary rather than starting salary. 


Pn bk 
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Question 6 (continued) 
Section 2 is scored as follows: 


Essentially correct (E) if the response includes the following five components: 

In part (c) the response states that the association is negative. 

In part (c) the response states that the association is strong OR linear OR both. 

In part (c) the response refers to both variables (semesters, salary) in context. 

In part (d) the response correctly compares the three majors. 

In part (d) the response provides reasonable values for the median salaries or refers to “median 
starting salaries” when describing a characteristic of the graph. 


Ot ee 


Partially correct (P) if the response includes three or four of the five components. 
Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 

e In part (c) the response can use phrases such as “negative association (correlation, 
relationship),” “decreasing relationship,” “inversely related,” or describe a negative association 
(for example, “the starting salaries are smaller when there is a greater number of semesters”) to 
satisfy component 1. However, describing the relationship between only two points does not 
satisfy this component. 

e In part (c) responses that describe the relationship incorrectly as weak or nonlinear do not 
satisfy component 2, even if the other characteristic is described correctly. 

e Only drawing a line on the scatterplot does not satisfy component 2. 

e In part (d) aresponse that says “chemistry has the highest median starting salary and business 
has the lowest median starting salary” (or the equivalent) implies that physics is in the middle, 
and satisfies component 4. 

e In part (d) if no values are provided for the medians, the response must use the phrase “median 
starting salary” at least once to satisfy component 5. 

e Inall parts it is acceptable if a response refers to salary rather than starting salary. 


Section 3 is scored as follows: 


Essentially correct (E) if the response states that there is a negative association for each of the majors 
AND the response notes the overall positive association. 


Partially correct (P) if the response states that there is a negative association for each of the majors 
BUT does not note the overall positive association: 

OR 
if the response states that there is a negative association for one or two specific majors (for example, 
for business majors) AND the response notes the overall positive association. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 
e Additional ways to note the overall positive association include stating that the original report 
was correct, stating that majors that take longer to complete tend to have higher starting 
salaries, or the equivalent. 
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Question 6 (continued) 


A response that does not explicitly name the majors or refer to the three majors but makes a 
general statement such as “However, for an academic major, there is a negative association” 
satisfies the requirement to state the negative association for each of the majors. 

A response that states the original report was wrong, or incorrect, or should be retracted, etc., 


cannot satisfy the requirement to note the overall positive association. However, a response 
that says the original report might be misleading (or the equivalent) but still notes the overall 
positive association satisfies the requirement to note the overall positive association. 
In all parts it is acceptable if a response refers to salary rather than starting salary. 
Complete Response 
All three sections essentially correct 


Substantial Response 


Two sections essentially correct and one section partially correct 


OR 

sections 1 and 2 partially correct and section 3 essentially correct 
OR 

sections 1 and 3 essentially correct and section 2 incorrect 
OR 


sections 2 and 3 essentially correct and section 1 incorrect 
Developing Response 


sections 1 and 2 essentially correct and section 3 incorrect 


OR 

section 1 essentially correct and sections 2 and 3 partially correct 
OR 

sections 1 and 3 partially correct and section 2 essentially correct 
OR 

One section essentially correct and one section partially correct 
OR 


All three sections partially correct 
Minimal Response 


One section essentially correct and two sections incorrect 
OR 

Two sections partially correct and one section incorrect 
OR 

One section partially correct and two sections incorrect 
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